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Abstract 

We prove that the generating polynomials of partitions of an n- 
element set into non-singleton blocks, counted by the number of blocks, 
have real roots only. We apply this information to find the most likely 
number of blocks. As another application of the real zeros result, we 
prove that the number of blocks is normally distributed in such parti- 
tions. We present a quick way to prove the corresponding statement 
for cycles of permutations in which each cycle is longer than a given 
integer r. 



1 Introduction 

A partition of the set [n] = {1, 2, • • • , n} is a set of blocks disjoint blocks 
i3i,i?2, • ■ ■ ) -Sfc so that U*L]^-Bj = [n]. The number of partitions of [n] into 
k blocks is denoted by S{n, k) and is cahed a Stirling number of the second 
kind. 

Similarly, the number of permutations of length n with exactly k cycles 
is denoted by c{n,k), and is cahed a signless Stirling number of the first 
kind. See any textbook on Introductory Combinatorics, such as [2] or [3] for 
the relevant definitions, or basic facts, on Stirling numbers. 

The "horizontal" generating functions, or generating polynomials, of 
Stirling numbers have many interesting properties. Let n be a fixed pos- 
itive integer. Then it is well-known (see [2] or [3] for instance) that 

n 

Cn{x) = ^ c(n, k)x'' =x{x + l)---{x + n-l). (1) 
k=l 
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In particular, the roots of the generating polynomial C„(x) are all real (in- 
deed, they are the integers 0, —1, —2, • • • , — (n — 1)). 

Similarly, it is known (see pi)|, page 20, for instance) that for any fixed 
positive integer n, the roots of the generating polynomial 

n 
k=l 

are all real, though they are not nearly as easy to describe as those of Cn{x). 

Rodney Canfield [8] (in the case of r = 1) and Francesco Brenti [5] (in 
the general case) have generalized ([I]) as follows. Let dr{n, k) be the number 
of permutations of length n that have k cycles, each longer than r. Such 
permutations are sometimes called r-derangements. Then the generating 
polynomial 

dn,r{^) = X] dr{n, k)x^ (2) 
k>l 

has real roots only. Present author [1] proved that for any given positive 
integer constant m, there exists a positive number so that if n > A^, then 
one of these roots will be very close to —1, one will be very close to —2, and 
so on, with one being very close to — m, to close the sequence of m roots 
being very close to consecutive negative integers. 

In this paper, we consider the analogue problem for set partitions. Let 
D{n,k) be the number of partitions of [n] into k blocks, each consisting of 
more than one element. We are going to prove that the generating polyno- 
mial 

Dn{x)=Y,D{n,k)x'' (3) 

fc>i 

has real roots only. We will then use this information to determine the 
location of the largest coefficient (s) of Dn{x). We also prove that the num- 
ber of blocks is normally distributed. Finally, we use our methods on r- 
derangements, and prove the more general result that for any fixed r, the 
distribution of the number of cycles of r-derangements of length n converges 
to a normal distribution. 

Note that the fact that the two kinds of Stirling numbers behave in 
the same way under this generalization is not completely expected. Indeed, 
while 1/e of all permutations of length n have no cycles of length 1, and 
in general, a constant factor of permutations of length n have no cycles of 
length r or less, the corresponding statement is not true for set partitions. 
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Indeed, almost all partitions of [n] contain a singleton block as we show in 
Section 3.1. However, as this paper proves, the real zeros property survives. 

Finally, we mention that the vertical generating functions (minimal block 
or cycle size is fixed, n varies) of permutations and set partitions have been 
studied in [S]. 

2 The Proof of The Real Zeros Property 

We start by a recurrence relation satisfied by the numbers D(n, k) of parti- 
tions of [n] into k blocks, each block consisting of more than one element. 
It is straightforward to see that 

D{n, k) = kD{n -l,k) + {n- l)D{n -l,k- 1). (4) 

Indeed, the first term of the right-hand side counts partitions of [n] into 
blocks larger than one in which the element n is in a block larger than two, 
and the second term of the right-hand side counts those in which n is in a 
block of size exactly two. 

Let Dn{x) = Y.k>iD{n,k)x^. Then (g]) yields 

Dn{x) = X + (n - l)Dr,-2{x)) . (5) 

Note that Di{x) = 0, and Dn{x) = x if 2 < n < 4. So the first non- 
trivial polynomial Dn{x) occurs when n = 4, and then 1)4(2;) = 2>x^ + x. In 
the next non-trivial case of n = 5, we get -D5 = lOx^ + x. 

Theorem 1 Let n > 2. Then the polynomial Dn{x) has real roots only. All 
these roots are simple and non-positive. Furthermore, the roots of Dn{x) and 
Dn-i{x) are interlacing in the following sense. If Dn{x) and Dn-i{x) are 
both of degree d, and their roots are, respectively, = xq > xi > ■ ■ ■ > Xd-i, 
and = 2/0 > yi > • • • > Ud-i, then 

> xi > yi > X2 > y2 > ■ ■ ■ > xd-i > yd-i, (6) 

while if Dn{x) is of degree d+1 and Dn-i{x) is of degree d, and their roots 
are, respectively, Q = xq > xi > ■ ■ ■ > Xd, and = yo > yi > • ■ ■ > yd-i, 
then 

> xi > yi > X2 > y2 > ■ ■ ■ > Xd-i > yd-i > Xd- (7) 

Proof: We prove our statements by induction on n. For n < 4, the state- 
ments are true. Now assume that the statement is true for n — 1, and let us 
prove it for n. Let = yo > yi > • • • > yd-i be the roots of D„_i(x). 
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First we claim that if > x > yi, then Dn-i{x) < 0, that is, the 
polynomial is negative between its two largest roots. Indeed, S'^_i{0) = 
Sr{n — 1,1) = 1, so Dn-iix)' > in a neighborhood of 0. This implies that 
in that neighborhood, Dn-i{x) is monotone increasing. As L'„_i(0) = 0, 
this implies our claim. 

Now consider ([5]) at x = yi. We claim that at that root, we have both 
< and -Dn-2(yi) < 0. The latter is a direct consequence of 
the previous paragraph and the induction hypothesis. Indeed, the induction 
hypothesis shows that if zi is the largest negative root of -Dn-2i then zi < 
Hi < 0, and the previous paragraph, applied to Dn-2 instead of Dn-i shows 
that Dn-2{x) < if X G (zi,0). So in particular Dn-2{yi) < 0. The 
former follows from Rolle's theorem. Indeed, by Rolle's theorem, between 
two consecutive roots of Dn-i, there has to be a root of As D^-i has 

simple roots only, say d of them, must have d — 1 simple roots, and 

therefore, by the pigeon-hole principle, there must be exactly one of them 
between any two consecutive roots of D^-i- In particular, there is exactly 
one root of D'^__^ between and yi, so the sign of D'^_'^{yi) is the opposite 
of the sign of D'„_]^(0) = 1, that is, it is negative. 

So when x = yi, the argument of the previous paragraph shows that the 
right-hand side of ([5]) is the product of the negative real number yi, and the 
negative real number + {n — l)Z)„_2(?/i). Therefore, the left-hand 

side must be positive, that is, Dn{yi) > 0. As Dn{x) < in a neighborhood 
of 0, this shows that Dn has a root in the interval (yi,0). 

More generally, we claim that Dn{x) has a root in the interval (yj+i, yj). 
For this, it suffices to show that Dn{yi) and Dn{yi+i) have opposite signs. 
This will follow by ([5]) if we can show all of the following. 

(i) D'^__^{yi) and D'^^^{yi+i) have opposite signs, 

(ii) Dn~2{yi) and Z)„_2(y«+i) have opposite signs, and 

(iii) D'^_^{yi) and Dn-2{yi) have equal signs. 

Just as before, (i) follows from Rolle's theorem, and (ii) follows from the in- 
duction hypothesis. In order to see (iii), note that by Rolle's theorem, D'^_^ 
changes signs exactly i times in (yi,0), while by the induction hypothesis, 
Dn-2 changes signs i — 1 times in (yi,0). We have seen at the beginning of 
this proof that in a small neighborhood of 0, D'^_^ is positive, while Dn-2 
is negative, so (iii) follows. Therefore, by dS]), Dn{yi) and Dn{yi+i) have 
opposite signs, and so Dn{x) has a root in {yi^i,yi). 

Note that this argument does not directly prove that Dn has exactly one 
root in (yj+i, y^), but it does prove that it has an odd number of roots in each 
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such interval. Indeed, D„ has opposite signs at the endpoints of 
it has an odd number of sign changes, and so, an odd number of roots on 
that interval. As the total number of roots of Dn is at most one larger than 
that of Dn^i, it follows that Z)„ has indeed exactly one root in each interval 

(2/i+i,yi)- 

The above argument completes the proof of the theorem for odd n. When 
n is even, then Dn is of degree d + 1, while Dn-i is of degree d. In that 
case, we still have to show that Dn has a root in the interval (— oo,yrf_i). 
However, this follows from the previous paragraph since the last root Xd of 
Dn must be negative, and cannot be in any of the intervals (yi+i,yj). O 

It follows from Theorem [1] that both the sequence D4, Dq,Ds, • • • , and 
the sequence Ds, Dy, Dg, • • • are Sturm sequences. The interested reader 
should consult |15j for the definition and properties of Sturm sequences. 



3 Applications of The Real Zeros Property 
3.1 Locating peaks 

If a polynomial ^^^=1 ^kX^ with positive coefficients has real roots only, then 
it is known [3] that the sequence 61 , 62 , ■ " " &n of its coefficients is strongly log- 
concave. That is, for all indices 2 < j < n — 1, the inequality 

holds. In other words, the ratio bj^i/bj is strictly decreasing with j, and 
therefore there is at most one index j so that bj^i/bj = 1. Thus the sequence 
61, 62, • • • 6„ has either one peak, or two consecutive peaks. 

A useful tool in finding the location of this peak is the following theorem 
of Darroch. 

Theorem 2 110} Let A{x) = Ylk=i ^kX^ be a polynomial that has real roots 
only that satisfies A{\) > 0. Let m be a peak for the sequence of the coeffi- 
cients of Aix). Let n = ^'(l)M(l) = S^J-^. Then we have 

l/i — m| < 1. 

Note that in a combinatorial setup, /i is the average value of the statistic 
counted by the generating polynomial A{x). For instance, if A{x) = Sn{x), 
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then // is the average number of blocks in a randomly selected partition of 

N- 

There is a very extensive list of results on the peak (or two peaks) of 
the sequence S{n, 1), 5(n, 2), • • • ,S{n, n) of Stirling numbers of the second 
kind. See [S] for a brief history of this topic and the relevant references. In 
particular [12], if K{n) denotes the index of this peak (or the one that comes 
first, if there are two of them), then K{n) ^ n/logn . More precisely, let r 
be the unique positive root of the equation 

re*" = n. (8) 

Then, for n sufficiently large, K{n) is one of the two integers that are closest 
to e*" — 1. In view of Theorem [2l one way to approach this problem is by 
computing the average number of blocks in a randomly selected partition of 
[n]. 

Now that we have proved that the generating polynomial Djiix^ — 
D{n, k)x^ has real roots only, it is natural to ask how much of the 
long list of results on Stirling numbers can be generalized to the num- 
bers D(n,k). In this paper, we will show a quick way of estimating the 
average number of blocks in a partition of [n] with no singleton blocks, 
and so, by Darroch's theorem, the location of the peak(s) in the sequence 
D{n, l),D(n,2), • • • ,D{n, [n/2j). For shortness, let us introduce the nota- 
tion D{n) = Y.kD{n,k). 

Proposition 1 For all positive integers n >2, the average number E{n) of 
Mocks in a partition of [n] with no singleton blocks is 

D[n) D[n) 

Proof: The total number of blocks in all partitions counted by D[n) is 
clearly Ylik>i kD{n, k). On the other hand, 

D{n +l) = Y^ kD{n, k) + {n - l)D{n - 1), 

k>l 

since the first term of the right-hand side counts partitions in which the 
element n + 1 is in a block of size three or more, and the second term 
counts partitions in which the element n -|- 1 is in a block of size two. So 
kD{n, k) = D{n+ 1) — (n — \)D{n — 1), and the statement follows. O 
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Let B{n) denote the number of all partitions of [n]. This number is often 
called a Bell number. There are numerous precise results on the asymptotics 
of the Bell numbers. We will only need the following fact [llj . 

logS(n) = n (logn — log log n + 0(1)) . (10) 

It is well-known that B{n) = D{n) + D{n + 1). A simple bijective 
proof is obtained by using the identity map on partitions counted by D{n), 
and removing the element n + 1 and turning each element that shared a 
block with n + 1 into a singleton block in partitions counted by L'(n + 1). 
In particular, this yields that B(n) > D{n + 1), and so oo since 

^D(n)^ ~^ C^*^ latter, note that for any h, there exists an so 

that if n > A^, then almost all partitions counted by D{n) have more than 
h blocks.) Therefore, D(n + 1) ~ B{n), and Q implies 

B{n) nB{n-2) 

E{n) = — J- -^~logn. 



B{n-l) ' ' B{n-l) 

An argument that is very similar to (but even simpler than) the proof of 
Proposition [1] shows that the average number of blocks in an unrestricted 
partition of [n] is '^^^^"^^ — 1. Using Theorem [2l this leads to the following. 
Recall that K{n) is the location of the (leftmost) peak of the sequence 
S{n, 1), 5(n, 2), • • • , 5(n, n). 

Theorem 3 Let n he a fixed positive integer, and let k{n) he the index 
for which D{n,k{n)) is maximal, or if there are two such indices, then the 
smaller one. Then K{n — 1) — k{n) ~ logn. 

Note that this result is in line with what we would intuitively expect, 
since a randomly selected partition of [n] has, on average, about blocks, 
about logn of which are singleton blocks, as can be seen by computing the 
probability that {1} is a singleton block by (jlOp . then using the linearity of 
expectation. 



3.2 Proving Normal Behavior 

Another frequent application of the real zeros property is proving that the 
coefficients of the polynomial at hand are normally distributed. Our tool 
in doing so will be a theorem of Ed Bender, proved in [IJ. The reader 
may also want to consult the survey paper of Andrzej Rucinski for this and 
other methods to prove normality in combinatorics. Let A„ be a random 
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variable, and let an{k) be a triangular array of non- negative real numbers, 
n = 1, 2, • • • , and 1 < k < m[n) so that 



^etgn{x) = Y.kSPn{k)x^ 

We need to introduce some notation for transforms of the random vari- 
able Z. Let Z = Z - E{Z), let Z = Z/y^Var(Z), and let Z„ N{0,1) 
mean that Z„ converges in distribution to the standard normal variable. 

Now we are ready to state Bender's theorem. 

Theorem 4 Let Xn and gn{x) he as above. If gn{x) has real roots only, 
and 

Cn = \J Var{Xn) oo, 

then Xn N{0,1). 

See [6] for related results. 

We are going to use this theorem in the special case when X„ is the 
number of blocks in a randomly selected partition of [n] with no singleton 
blocks, that is, an{k) = D{n,k). Then pn{k) = D{n,k)/D{n) for all k, so 
Theorem [1] implies that gn{x) = j^j^ X]a:>i ^(^i ^)^'^ ^^^^ roots only. 

There remains the task of showing that (t„ = y^Var(X„) — > oo. We will 
accomplish that following the lead of the classic paper of Harper [T^], in 
which he proved that the number of blocks of unrestricted partitions of [n] 
is normally distributed. We will have a little bit more details to handle, but 
the crucial ideas of Harper's proof survive. The first step is the following 
proposition. 

Proposition 2 For all positive integers n > 2, we have 

L"/2J 

D{n, k)k^ = D{n + 2) - (2n + l)D{n) - nD{n - 1) - n(n - l)D{n - 2). 

k=l 

Proof: Take a partition of [n] into k blocks, none of which are singletons. 
Insert the entries n + 1 and n + 2 in any of these blocks in k'^ ways. Summing 
over k, this shows that there are "^^i^ D^n, k)k'^ partitions of [n + 2] into 
blocks, none of which are singletons, so that n + 1 and n + 2 are in blocks 
that contain at least two elements that are at most n. How many other 



8 



partitions of [n + 2] are there without singleton blocks? There are D{n) 
such partitions in which {n + l,n + 2} is a block. By the sieve formula, 
there are 2nD{n) — n{n — \)D{n — 2) such partitions in which there is either 
a block of the form {z, n+ 1} for some i < n, or a block of the form {j, n + 2}, 
or both. Finally, there are nD[n — \) such partitions in which {i,n + l,n + 2} 
is a block for some i < n. O 



Lemma 1 Let Xn be the number of blocks in a randomly selected partition 
of [n] with no singleton blocks. Then Var(Xn) = Cn ^ oo. 

Proof: We use the identity al = E{X^) - E{Xnf. Note that E{X^) = 

— ; and the numerator of this fraction was computed in Propo- 
sition [2j Furthermore, E{Xn) = E{n) was computed in ([9]). That is, 

2^ D{n + 2) nD{n-l) n{n-l)D{n-2) 
D[n) D{n) D{n) 



and 



D{ny D{nY D{ny 



Therefore, 



^_, D{n + 2) D{n+ir 
" ' Din) D(n)^ ' 



+2n 



D{n - l)D{n+l) 



D{nY 

n'^D{n - 1)2 + nD{n)D{n - 1) + n{n - l)D{n)D{n - 2) 



1. 



Z)(n)2 

We will now consider the first three terms of the right-hand side one by 
one. It is proved in Lemma 2 of [12] that (^—^^^ — "~B(n^) ~^ fact, 
it is proved there that 

B(n + 2) B(n + lf\ n 

+ o(l), 



B{n) B{nY J r(r + 1) 

where r is defined as in ([8]). As we mentioned in the paragraph preced- 
ing ([8]), we know that e^ ~ n/logn, so r ~ logn — log log n, and so 
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{^Wr ~ ^W^) ~ n/log^n. As we know that B(n) ~ D{n + 1), it 

follows that (^ ^^(^f - ^D{n)^ ) ~ n. 

It is proved in [7] that the sequence D{2), D{3), ■ ■ ■ is log-convex, that 
is, D{n — l)D{n + 1) > D{n)'^ for all n > 3. Therefore, the second term is 
non-negative. 

Finally, it is routine to prove, using the fact that j)^^"^ ~ %^{^2) ^® 
roughly n / log n, that the third term is of order c log'^ n. Therefore, an ^ oo 
as claimed. O 

We have seen that both conditions of Theorem S] hold. Therefore, we 
have proved the following theorem. 

Theorem 5 Let X„ denote the number of blocks in a randomly selected 
partition of [n] with no singleton blocks. Then N{0, 1). 

3.3 The normahty of the number of cycles of r-derangements 

Let r be a fixed non-negative integer, and let Yn be the number of cycles in a 
randomly selected r-derangement of length n. Recall from the introduction 
that an r-derangement is a permutation in which each cycle is longer than 
r. Theorem [Hand a recent result of present author provide a very quick way 
of proving that Yn N[0, 1]. 

Recall from the introduction that dr{n, k) is the number r-derangements 
of length n that have k cycles. Then it is known [5j that the generating 
polynomial dn,r{x) = X]fe>i dr{n^ k.)x'^ has real roots only. So one of the two 
conditions of Theorem S] holds for the variables Yn ■ We need to show that 
the other one holds as well, that is, that ^JVaxiYn) — »■ oo. We need the 
following tools. 

Proposition 3 Let Xn and gn{x) be defined as in Theorem^ If gn{x) 
has real roots only, then 

m{n) 

where — Ai, — A2, • • • are the roots of gn{x)- 

Proof: Define independent random variables X„^i, X„^2; ■ ■ ■ , ^n,m{n) so that 
they are all 0-1 variables, and P{Xnj = 0) = j^j-, and P{Xnj = 1) = 
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1+A 



1 



. It is then straightforward to verify that ^ 



im(n) 
,j=l 



X, 



Xn by checking 



that g{n) is the product of the probabihty generating functions of the X, 



Therefore, Var(X„) = X]j=i Var(X„j) = Yljli (i+x )'^ claimed. O 



In other words, if we know the roots of the generating function gn{x), 
then we can compute of Var(Xjj). Note that we do not need to compute 
Var(X„); we only need to show that Var(X„) oo. To that end, we do 
not necessarily need to know all roots Xj; sometimes it is enough to know 
just some of them. The following theorem provides sufficient information 
about the roots in the special case when Xn = Y^, and so gn{x) is a constant 
multiple of dn,r{x). 

Theorem 6 /^/ For every negative integer —t, and every e > 0, there exists 
a positive integer N so that if n > N, then dn,r{x) has a root xt satisfying 
\ — t — xt\ < e. 

Therefore, for any positive integer t, and any e > 0, we can find an N so 
that if n > A^, then 



and therefore, Var(y„) — > oo. So we have verified that both conditions of 
Theorem m hold, and therefore, we have proved the following theorem. 

Theorem 7 Let r he a fixed non-negative integer, and let Yn be the number 
of cycles in a randomly selected r- derangement of length n. Then Yn 
N[0,1]. 

4 Further Directions 

It is natural to ask whether Theorem [1] can be generalized to partitions with 
all blocks larger than r, where r is a given positive integer. That would 
parallel the result ([2]) of Brenti [5] on permutations. 

A consequence of the fact that the polynomials Dn{x) have real zeros is 
that for any fixed n, the sequence D{n, l),L>(n,2), • • • ,D{n, [n/rj) is log- 
concave. In [H], Bruce Sagan provides a proof for the special case of r = 0, 
that is, that of the classic Stirling numbers of the second kind. However, 
his injection proving that result does not preserve the no-singleton-block 
property. Now that we know that the statement is true, it is natural to ask 
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for an injective proof. Similarly, as we know that dn,r{x) has real roots only 
(see ([I]) ) , we can ask for a combinatorial proof for the fact that the sequence 
dr{n,l),dr{n,2), ■ ■ ■ ,dr{n, [n/rj) is log-concave. 
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